Syllables, Morphemes and Bayesian Computational Models of Acquiring a Word Grammar

نویسندگان

  • Çağrı Çöltekin
  • Cem Bozşahin
چکیده

We report a computational study on the CHILDES database for learning a word grammar of Turkish nouns. The syllable-based model converges to a morpheme-based model in terms of overlaps in the set of lexical hypotheses. Morphology is a hidden variable in all models, and the search problem for hypotheses is narrowed down by a probabilistic conception of universal grammar à la Combinatory Categorial Grammar. The convergence of the syllable model suggests that morphemehood can be an emergent computational property.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تکیه در زبان فارسی

Abstract: This research has been carried out in the framework of Auto segmental-metrical (AM) phonology to study the stress in Persian. Two types of abstract and concrete prominences were distinguished in which the first one refers to the stress and the second one refers to the pitch accent. Stress is assumed to be a lexical property of the lexemes, but pitch accent is assumed to be an intonati...

متن کامل

The changing status of 'filler syllables' on the way to grammatical morphemes.

The appearance of 'filler syllables' (called here PAEs, for Prefixed Additional Elements) in the late single-word period is analysed in relation to the emergence of grammatical morphemes, by confronting data from the longitudinal study of one child acquiring French, video-recorded between 1;3.2 and 2;2.6, with four hypotheses making different claims about the kind of language knowledge underlyi...

متن کامل

Accuracy Order of Grammatical Morphemes in Persian EFL Learners: Evidence for and against UG

This study addresses the acquisition of the morphological markers in Persian learners of English as a foreign language. To this end, the accuracy order of nine morphemes including plural –s, progressive –ing, copula be, auxiliary be, irregular past tense, regular past tense –ed, third person –s, possessive -ʼs and indefinite articles was studied in 6...

متن کامل

Probabilistic modelling of morphologically rich languages

This thesis investigates how the sub-structure of words can be accounted for in probabilistic models of language. Such models play an important role in natural language processing tasks such as translation or speech recognition, but often rely on the simplistic assumption that words are opaque symbols. This assumption does not fit morphologically complex language well, where words can have rich...

متن کامل

Syllable weight encodes mostly the same information for English word segmentation as dictionary stress

Stress is a useful cue for English word segmentation. A wide range of computational models have found that stress cues enable a 2-10% improvement in segmentation accuracy, depending on the kind of model, by using input that has been annotated with stress using a pronouncing dictionary. However, stress is neither invariably produced nor unambiguously identifiable in real speech. Heavy syllables,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007